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NOTHER EDUCATIONAL CAMPAIGN 
] b ki 
1 ccount of the use of paign principles in 
called “The n \ lin kl 
ed we limited r discuss to progr 
ler certain conditions. The condit s chosen wert 
e recognize DV the c llective term ot campaign 
hove nt ned re] on) is ch 
rst, by recog! ed need « desire on the pat ot 
raAnize - second, by definite time limits, parti 
hi cl ( rd. by l defin ely orgal d ettort t 
nd sustain interest \ fourth characteristic rests upon 
t the success or failure of a campaign is determined 
duals other than thos ho organize it, whether it be 
s in the political compaign, converts in the rel ! 
r subscribers to the fund. Furthermore, the compaign 


vilezes of co-operation whether civic, social, moral, or 
king that repo1 Wwe stated the belief that the plan would 
’ 


four fundamental processes 1n 


numbers. We are now ready to state that the plan ts 
nly just as feasible in the subject of spelling as in arithmetic 
the J Educat I h, 2:481, June, 192 
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ist as feasible in the formal or n nical phases of any el : 
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THe SPELLING CAMPAIGN 
The Pupils—T wenty classes in fourth, fifth, and sixth ¢: 
in two buildings, were each assigned to one of two grou 
of these groups was to be campaigned, and was designate 
experimental group; the other group, which was to tak 
itial and final tests, but was not to be campaigned, w 
the control group. Each group consisted of ten classes, ar 


> i 


groups were estimated by principals to be of appr 
equi | ability 
The Material—Ten lists of fifty words each, with 


tive phrases, were used for tests. Each list was made uy 


first fifty words in columns H to Q inclusive of the Ayres 


The Plan—Teachers were furnished a copy of the d 
on alternate mornings. They gave the test by reading 
trative prase containing each word, and then repeating t! 
to be written in list form by the pupils. 

] 


The papers from the ten classes in the experimental 
were corrected the day they were written and individual list 
misspelled words were returned to the teacher to be giver 
pupils for study in the spelling period the next day. The 
of this group were also told that the test would be rep: 
exactly a month. No lists were returned nor announce: 


made to the classes in the control group. 


TABLE I PERCENT OF CORRECT SPELLINGS GRAD! 


EXPERIMENTAL Cont 
Ayres’ List STANDARD Grot 
First Second First ~ 
H 9S 96 95 
| On 91 
J O4 06 ag 
K 92 97 84 
88 90 93 | 4 
M S4 83 O4 73 
N 79 93 67 71 
() 73 73 91 5S f 
66 63 16 14 
5S 5S 79 14 ‘ 
Tor Al S4 93 73 


| 
| 
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E-XPERIMENTAI ( 
STANDARD G 
Firs 
a7 OS Q7 
1 9] ay St) 
70 7s ou 
73 79 SY iS 
St) 
Ii! PERCH r OF CORRECT SPELLINGS GI \ 
l STANDARD GROUP GRO 
First Second birst 
I 100 OS a7 
100 OS Gs 
ON ON 17 
| OH Q5 ON Q3 
‘ On 9] 
9] 97 sO so 
SS 4 St) 
() S4 70 Sb sO) 
93 7 
took a school month to give the ten tests on alternate days 


next month the tests were repeated in the same order (1 


reasing difficulty, H to Q) and with same time intervals except 
the first five were given on succeeding days instead of on 


lternating days, thus using three weeks for the second tests 
Each spelling period on no-test days was devoted to regular 
by the control-group classes, and to study of individual 


spellings by experimental-group classes. Mimeographel cop- 
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od) and «al 


and phrase lists were also furnished pupils 


up to use 1f they wanted to review the entire 


[ Table sh Ws the percent 


in each list of words made by the experimental 


in each trial. Tables II and III give th 
respectively. 

[V is a summary of results. Part (a) gives t 
perimental groups and part (b) the data for tl 


s. A comparison of the two parts of the tabl 


ol groups to have been somewhat poorer spellers 


ts and that their gain was quite negligible in co: 


vear s work between fourth and fifth grades is 


f the experimental groups. 
SUMMARY AND CONCLUSIONS 
eX ental groups the increase in percent of 


i 
33.400 words by the fourth-grade pupils 
of 48,050 words by fifth-grade pupils 
; and of 30,050 words by sixth-grade pupils 
These percent increases of 9, 5, and 4 wer 
f a school year in regular spelling periods 
nt increase in the Ayres scale on the same | 


een fifth and sixth grades is 4.8 percent 


TABLE IV. SUMMARY OF RESULTS 


a Experimental Groups 
NUMBER OF CAIN 
VEER OF Nt MBER OF Worps 
P Worps MISSPELLED Number 
ACH I RIAI 
First Second 
33400 5527 2312 3215 
961 48050 4552 | 2327 2225 
601 30050 12063 1045 1018 
2230 111500 12142 5684 6458 
ISSPELLED 11 § 29° 


7 
trol 
T ble 
| 
te 
In the 
| S4 to 9g 
| 
to 
moon ; ft] 4 
mean pere te 
\ 19 
VI 
Percent M | 
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EDUCATIONAL CAMPAIGN 


NUMBER OF MissPELLED 


SPELLED Trial of 
RIA 
TRIAI 
First Second 
652 320600 TOLLS 1] 
lie 5346 1008S 138 
Sv] $1050 | 107 
2245 112250 17458 15642 Isl 
M ELLED 16 14 8° 
‘ words g pe 


IV) of their misspellings from the central halt « 


list in one-fifth of a year’s time simply because they 


where they stand, and that they will have a chan 
he value of their efforts, are we not overlooking a 
he subjects, or divisions of a subject, which lend t 


definite tests of dev elopment ? 
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ils of the intermediate grades will eliminate 53 pet 
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RECENT DEVELOPMENTS IN SILENT-READI? 


ne 
nus an ¢ | ent 1 superior to what we had tw 
In particular, aavancement has been made adiony 
) n the testing procedure ; (2 
\ CULLIA types ort mate! cli, 
‘ { mnrehencns 
ypes OF comprehension 
|? \ \DEQUATE EQUIPMENT OF SILI 
READING TEsTs 
s of sulent-reading tests —The most set 
ing tests and scales now available for group testing 
' ( hed » three tvpe is follows 
( gray tests with units of equal t 
| Silent Revised 
ip ent S I, 2 
Pete Vill vi 
~ vit < oT is © 
Me Rea Scale (te 
t ] (one I } 
Il, tor g es VI to x11 (tw 


\ S IV, a \ 
i il ‘ 
4 


Types 1 and 2 are alike in that the pupil may re-read 


raph unit until he thinks he can indicate a correct 


| TESTS 

| RK. S 

| = 

: | rdyce’s Scale for Measuring Achievement in R 

t for gr eS III and Ivy | 
lest tor grades v and VI | 
lest for the Junior High School 

piri? 
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response. In this respect, all these tests measure a 
f comprehension ability. Those of Type 3 measure 
f continuous reading and the abilitv of the pupil t 
points in a single reading The kind of reading 

( nd the character r the ¢ prehension measured 


ly different from that used in the tests of Types 1 


Gita 
ests (Revised ) and the Burgess Scales are 
certain fundamental characteristics, as indicated in the 
itline. thev are different in certain respects ne 
units in the Burgess Scales are all of the same type 


1 while in anv one of the Monroe tests there are 


ational, and poetical units The 


ptive mrorla 

lan is in accord with the present tendency in attempt 
eas ability to read ecific tvpe of material with 

7 

ic 
ever, the Burgess scoring plan limits n aterially the use 
f the scales fhe Monroe scoring plan shows rate in 
WW ds read per munute and comprenens on in te s ol 
| 


ses. This makes it possible to gray 


for school and easily to determine the relative efh 
: rate and c my] rehension In the Burgess test, the 
re depends upon the number of correct responses, and 
re in no wav reveals the pupil's relative rapidity and 
Consequently, it is impossible to graph the results 


| and thus to show the relative efficiency as to speed 
racy 

Burgess scores range from 0 to 100 according to the 
of paragraphs read successfully. The fourth grade must 
uccessfully seven paragraphs to obtain the standard 


re of 50, while the fifth grade is required to read eight 


iphs successfully to secure the standard median score 
The requirement is similarly increased for each succeed- 
ide. This plan has several disadvantages in comparison 


the simpler plan used by Monroe—+. e., that of allowing a 


orehension score of one for each unit read successfully. The 
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Burgess plan « plicates the hing of the results 
It difhcult the locating of 
7 ? + 
I el of kes difficult the « 
ot 
le expe et with the Mz nroe 
Heves that the useruiness of 
ci na tot 1p rw be mat tally 
the number of units re 
( eas na the percent of these units wil 
Lie ¢ pl eCnsion Sure I 
plan OF st Ing, the | il or the school that has a Ml 
] hey ] nnoreher 
prenet n ( cy We Vn as a percen we s 


I as now 1} nd the class 
teacher needs to know about the class and the individ 
is the relative efficiency in rate and accuracy and 1 


should reveal this without the necessity of referring t 


papers or to tedious comparisons with standar« 


a mations.—About the same time that Burgess had « 


that d ty or power tests are not applicable in m« 
reading ability McCall had decided that the difficulty « 

test is the best measure of reading ability. The units 
Thorndike-McCall Scale are progressive in difficulty. T] 
paragraph is relatively easy, the next somewhat more d 


and soon. The paragraph sections of the Haggerty tests { 
the same plan. The two differ, however, in the types 
ponse required. Haggerty follows the plan of requiri: 


pupil to select from a number of listed responses. TI 
lends itself to economy of time and to a high degree of 
in checking and scoring. The Thorndike-McCall plan ts t 


e answers to questions. This plan | 


the pupil write out th 
tain disadvantages. The checking of the answers 
consuming and there is danger of inaccuracy on the part 


grader. Furthermore, this plan limits the test very largel 


type of question that tests the pupil’s ability to get cert 
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| d consequently reduces to a minimum the use 
ns that test the pupil's grasp of the major relational 


— 
~ 


agral 
cost of testing—The Monroe and the Burgess 


relatively inexpensive. The Thorndike-McCall tests are 


) t more expensive, and the Haggerty tests are very ex- 
Testing material has been relatively more expensive 

hing material because testing material can be used but 

ile teaching material in the form of books and pamphlets 

ised over and over. In the case of the Stone Series of 

e-Reading Tests, while the initial cost is relatively high, 


hooklets and rate cards may be used over and over 


itive-reading tests —Besides measuring the ability of a 


do the study type of paragraph reading, the supervisor 
sroom teacher need to be able to measure his ability 
the main points in a narrative at a single reading and 

ble to measure accurately the rate at which the whole 
is read. The Stone Series of Narrative-Reading 
; designed for these purposes. At present there is avail- 
st for the third and fourth grades, one for the fifth and 
ides, and one for the junior high-school grades. Addi- 
les are in preparation. It is planned eventually to have 
each grade with two forms so that it will be possible 
re attainment and progress in any grade with a story 
iculty level of that grade. This means that the teacher 

h grade will have a testing equipment on narrative read- 

will enable her to diagnose her incoming class and to 

: re progress after a certain period of instruction. This 
use of testing equipment the same as we regularly use 
ling book, the writer conceives to be essential to the most 


type of teaching. 


~ 


II will give a detailed account of the construction and 
of the Narrative Reading Test for the Junior High 


shed by The Public School Publishing Co., Bloomington, IIL. 
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Part Il Tue Stone SERIES OF NARRATIVE-READING T 


Difficulties involved in constructing a measure of na 
a Recent reading tests have been | 
test It much ea to measure ability to read parag 
than ability to read a complete narrative. It 1s very ¢ 
dopt le | edure in a |] graph test, while it 
Lirip ble to do na ve test Ina speed es 
yraph reading, in such tests as the Monroe or the Burg 
time taken to do the ly IS v short, while it ts 
impossible t Live I ( ( n less t e than it 
e a power ¢ ditheult test 1 paragr ph readin 
the problem of obt ng an accurate measure 
ch terms as the number of rds per minute 
serious one in devising narrative tests. Gray invented 
ccurate means of finding the pupil's rate of reading, but 
Ives individual testing; and this is too time-consu! 
regular classroom purposes On the other hand, cla 
teachers and supervisors have become very skeptical ot 
monly used plan of having the pupils read a minute 
t nal mark the last word read 
\nother problem ts so to relate the rate and « 
te that ( have a rate measure and a ¢ pre ( 


neasure based on the same reading act. Only when this is 


can we satisfactorily characterize the reader as slow and 


ate, rapid and accurate, slow and inaccurate, or rapid 


accurate. No group narrative test printed previous to 
olves this pt blem 


Construction of the Junior High School Test.—In tl 
struction of the Junior High School Test a number « 
liminary experiments were tried, an account of which it is 
possible to give in this brief article. The following prin 
were kept in mind from the beginning 

a. The rate test should include for all pupils the read 


ing of the whole narrative 


b. The test for the upper grades should be based upon 


a narrative of upper-grade level. 


| 
| 
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The comprehension test should measure the pupil's 
rasp of the leading points of the narrative 
th the rate test and the comprehension test should 
as simple in the testing procedure as possibl 
[he narrative should be of such length that no pupil 


uld complete it in less than two minutes 


first story tried was too easy; but a passage fron 
s “The Spy” proved to be of eighth-grade level, and it 
ted. This narrative passage is about 1400 words in 
During the spring of 1921, eighth-grade pupils to the 


4 words 


f 563 were tested. The median rate was 2 


minute. On the comprehension test of ten questions, 

was 64, ten points being allowed for each question 

nswered. This extensive trial of the test followed 

ry trials and revisions. By this time the questions had 

und to be approximately equal in difficulty. However, 

tructing diagnostic charts for interpreting results it was 

it ten questions did not give a sufficiently refined scale 

early in the work of constructing the test it was de- 

have preparatory rate and comprehension tests preced 

real tests. The preparatory tests are carried out in all de 

» the real tests. The unit of reading is, of course, much 

[he rate test is conducted as follows. The pupils are told 

they are to read the narrative through from beginning to 

|, reading as rapidly as they can and at the same time get the 

Each pupil understands that he is not to look up until 

: finished reading the story. Then he looks up and gets his 

‘om a card then being exhibited and copies it on his 

A new rate card is exhibited each five seconds. The 

ber on the card represents the number of words read per 

ite. A pupil, or some other assistant who has been trained 

the purpose, exhibits the rate cards under the supervision 
he examiner. 

\iter the pupil has recorded his rate score, he proceeds 


ediately to answer the questions without being permitted to 


4 
( 
« 
i 


read. Hach question has five answers listed underneath it 
e of h is correct. The pupil answers the que 
lecting of the answers and indicating it on his 
There e twenty questions in the tests as pul 
all ch time as he needs in answering 
I TI | Is are instructed to the effect that 
’ ish ering the questions, he is to take out a 
l until all | finished 
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Lil 


itt 


d 


lt takes about an hour to vive the preparatory an 
TI includes the checking and _ rec] 
nswers by the pupil t procedure which makes 
sary 1 the teache to nd time outside of scl] 
the 
ts of testing seventh- and eighth-qrade 
Figure 1 shows the distribution « 
ot 320 eighth-grade pupils on a scale ranging 
per minute. The graph shows that there y 
Is vy d at a rate above 500. Only 1 puy 
low as 100 words per minute. The m 
ol 


er end of the scale 


15. We see that the curve is skewed < 


2/ 
| 


l DISTRIBUTION OF 
EIGHTH-GRADE PUPILS 


RATE SCORES OI! 


* 
L re 
I 
| 
| 
th 
{ 
| | | 
| at | 
and 1 
to the low Otherwise the distributio; 
| : ds fairly closely to the normal curve of distribution 
| | 
| | 
| 
| 
| 7 | 
FIGURE 
| | 
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range of attainment is clearly in evidence 
here is really such wide vari 


m arise Sas to whether t 
r whether 


tainments of eighth-grade pupils o 
ked variability is due to inherent defects in the measur 
For instance, is the story of uniform interest 


the rate al 


girls alike, or do some of the pupils read rapi 


hey are gripped by the story while others read slowly 
he story does not appeal to them? It was the general 
those connected with the testing that the story is 
interesting to the pupils and about equally so to both 
| girls. However, a comparatively small number of 


ted after the test that they did not care for the story 


be remarked in passing that in selecting the stories 
fifth- and sixth-grade test, preliminary experiments were 
| out to aid in securing stories having a uniformly strong 


to the pupils. Six stories were selected with a view ot 


those that would be interesting to all pupils All the 
given room were asked to read six stories at on 
nd to put down afterwards the order of preference 


stories having the highest degree of uniformity of 
| were selected for the test. The rate score and the com 
on score, in each case, is the average ot the scores made 
‘what the 


two stories. This plan tends to decrease somewl 


riability in the pupils’ scores. 


6¢ as 
j 
| 
| 
a6 | 
] 
fe 


THE DISTRIBUTION OF THE COMPREHENSION SCORES 


OF 320 EIGHTH-GRADE PUPILS 


It 

| 

ns 

| 

| 

| 
RE 2 
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Figure 2 s] s the distribution oft comprehensi ns 
the Ji H S 1 Test o1 scale ranging from 
( nti | r the 20 questions The 
e toll cl ly the 1 curve of d 
l ll be noted tl no pupil made pertect score, 
| that 1 | 1] wered all 20 questions correct! 
wise, no pupil Consequently all pupils were 1 
edian is 57 } questions answered correctly 
it ( dered t this s vy is of about eighth-grade | 
results for 274 grade pupils show that it is 1 
the seventh grade als he comparative results of the 
na « hth ry des indicate that the test will also be an 
eCasuring inst ent I the ninth grade 
n deriving the medians for Table I, data from onl 
ls which gave the test in both the mid-seventh | 
the mid-eighth grade were included. This was to in 
ccural ite nd « mprenensio1 intervals between the 
nad the « ehtl des 
TABLE I. MID-YEAR, MID-GRADE MEDIANS FOR ST. 1 
(JRADE RaTeE | COMPREHENSION | No. PupiLs 
VII 10 2 27 
245 11 4 320 
The use of the dt Idnosty charts the classr om t 
the most important fact that a test of this sort should rev: 
the type of reading each pupil does. In almost any ri 
class of pupils, we find the slow accurate reader, the rapid 
ate reader, the rapid inaccurate reader, the slow ina 
reader id all of the intermediate combinations of the tw ' 
ents of rate and accuracy. The diagnostic chart is desig 
to show the teacher graphically just how her class distribut 


The chart is 
structed in such a way that on the basis of the 320 eighth-g: 
pt 


comprehension division, 


among nine rate and comprehension groups 


Ipt will be the n 


ls tested approximately in 


abo 


50 percent 


7 


ul 25 percent 


in the inferior dv 


rh 
s10n, and mut 25 he 


Ss followed with reference to the rate divisions 


percent in division labeled good 


plan 1 


| 
| 
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mporehension 
é 19 2 4115/16 119 
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EE 
Infer r (25 Medium a 5%) 
3. THE DIAGNOSTIC CHART FOR A MID-EIGHTH GR. 
CLASS IN SCHOOL A. 37 PUPILS 
Charts for three eighth-grade classes are reproduced repre 
three distinct types of classes with reference to reading 
ney. Figure 3 is the chart for a class in School A that 1s 
bove the average in rate and somewhat below the average 
prehension. The distribution indicates a small negative 
lation between rate and comprehension. The fact is that 
lass in its training had been speeded up at the expense of 
ehension. A number of these pupils should be told t 
somewhat more slowly. This point should be particularly 
hasized with the nine pupils falling in the rapid-inferior 
ision. It will be noted that in this class the two pupils rank 
x highest in comprehension are medium-rate readers. The 
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The next chart, Figure 4, reveals a situation in §$ 
that is radically diffe t than that in School A in t 
vrade. This class is considerably above the standard 
prehension (median 13.0) and considerably below 
(median 202) In the preceding chart the dots tended 
tribute toward the upper left-hand corner, while in t 
they tend to fall toward the lower right-hand cornet 


are seven pupils who are slow but good in comprehens! 


wossible that this school has stressed thoroughness 


| 

FIGURE 4 HE DIAGNOSTIC CHART FOR A MID-EIGI : 
| CLASS IN SCHOOL B. 42 PUPILS 

| hool 
| 
n 

| 


j 
f 


it appears tnat ning 1 WwW needed Vy Unis 
e expected, of course, that ¢ 1 pupil or each a3 
- itu | ‘ 111 is 1s J 
iow intelugence, 1n Order, t comprehend et 
| ] ] than the hricht 
- 17 COI preh nil tter re cit ; ter The 
L 
I qaiers ¢ Cl L iit ecaians 
| ] S Tall im l Li Cl I 
i 
4 
4 i i 4 | i j i 
e grouped according to t needs and should be a, 
ill different 
Or + 
"111 
t 7 cslent 
tains the following statement he fastest rea 3 
ce the best readers Therefore, if vou develop speed, 
e the quality OF The childs reaaimy reast 
] ] +} +} lt rol rie in 
es not ccord with tee scarcn WOTK 11 
ut 1t also fais to accord with the Ce mon experience * 
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FIGURE 5 THE DIAGNOSTIC CHART FOR A MID-EIGHTH 
| CLASS IN SCHOOL C. 38 PUPILS 


| of teachers. The data resulting from the application 
Junior High School Test show about what the St. L 
the Cleveland Surveys showed, namely that the rapid-go 
and the slow-poor pupils are about twice as numerous 
rapid-poor and the slow-good. However, this does not 
that increased speed brings improved quality. There is 
to believe that School A 


quality, while School 


has developed spe ed at the expel 
> has not given sufficient attent: 
speed. No doubt very slow readers may comprehend bett 


they read somewhat taster. Probably all such pupils sh 


encouraged to speed up and should be given types of d: 


signed to produce a wider span of recognition per eye-{ 


| 
| 
| 


Cr 
= 
¢ 
~ 
of 


caras 


ome ~ 
f 


ther hand, ny pupils have reached the point in t ic 
te that for them en the maximut 
C iV Cll SIVes Lic i CICTIC ( 
1 For them to be speeded up without a corresponding . 

ment in abil lv to lowe 
f their readit id-careless rea 5 
uld be given training that develops accuracy \ decrea : 
d will probably be advisal le i 
uid be ag uped and Ly 1 reat ins l l 

to eir neeas Lhe qgiagnostic Chart Is ¢ 
e! ng their needs on e Dasis Of each pu] 
ent in rate and Comlpreneristis Beg 
ry.—Each of the narrative reading tests has the fol i 
stinguishing characteristics 
q 
[here is a preparatory test, shorter than t real test, ie 

rried out in every detail just like the real test . 
There is a unique plan for measuring the rate involving Al 

faset of rate amie. To get his rate score all the pupil Be 
ut to do is to look up when he comes to the end of th fa 

opy the number then being exhibite: a 

comprehension test 1s upon the read which h = 
sured as to rate It consists twenty questio! 
pupil's grasp of the leading points of the story 

cts one of the five listed nswers 1! each ques rae 
simple and economical in time; it 1 a 

f the comprehension of a narrati .3 
[he tests are rn woth tine t 

ne tests are concernes Wil a sper re 
1 

il and thev measure a specific tvpe of comprehension a 

| 
Since the pupil enters His data on an MeNpe»nsive 
st material mav be used over and over again, thereby rt : 
ng to a minimum the per pupil cost of testing : 
b> nostic chart, the teacher Can Ccasily 1 

ul al VZINgE Individual ¢ ses and 
ng her pupils for more eftective instruction in accor lar 
lividual needs % 

The time required for testing a group of pupils including . 
ecking of the answers and the recordin t the scores 1s ‘ 
10 to 60 minutes This means that the teacher 1s not re 7 
to do a lot of checking and scoring outside of s 1 time 4 


SOME ESSENTIALS IN SCHOOL SUPERVISI 


] 
+1, 
e general igreement to the ] 
is f the purpose of improving class 
es \ ree dot not 
t efficiency in class work 


through either a greater quantity 
lity, an equal quantity of better quality or 


ty and quality at a lesser time expenditure. TI 


s tended to be recruited either throug! 
this type of work or through selection « 
ked success in some particular line and 
t ug 1 cht | ( lity to direct oth 
simular ccess But whatever may hav 
method of selection, there are certain « 
must be borne in mind if the work is to be satis 
se of this paper to discuss a few of thes: 
lection of the appropriate subjects by means 


desirable educational objectives is a problen 
current practice, and legal or institutional 
tten more potent than the supervisory fore: 
deal with these subjects in the schools. However, 
| nal philosophy which interprets the content of the 
terms of present day needs and probable future needs 


do mu n determining the educational ends to be secu: 


DeriniteE GOALS 


\ssuming that these general subject fields are already 


ined, it seems to me that the first and most important ess 
. > 
at the Educational Research Depart: 
; O State Teachers Association, Cedar Point, Ohio, June 28, 1922. 
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EK. J A SHBAI 
nnot he 1ust hed not maintained This increased et 
nifest 
criven qu 1 

n 
t! 
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| 
means of these subjects 
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is a rather accurate knowledge of definite goal 
The best supervisors are not content today 
als, ethereal objectives, indefinite gene 
eading in order that children may be e cult 
is not a satisfactory statement for the su 
1e it may be as an ultimate objective. The supe 
ily have an ultimate end in view but must ! 
rve number of intermediate stations definitely « 
reaching this ultimate. I like to think of these 
ils as towns or cities along a well-marked 
h is definitely marked so we know when wes 
We know how far we have come since leavir 
we know what the next one is on the rout: 
travel before arriving there. 
ing ourselves for a bit to the field already 
x, let us seek farther for acceptable objectives. O 
ve desire our children to learn to read 11 det 
ight from the printed page. This is bet 
nt of aim, but it 1s still unacceptable f it 
supervision. It is not a specific goal. ( 
to an automobile trip, the first statement (7.¢., of cultu 
tive) would be like saying that the place to go is wher 
te is pleasant, the people hospitable, and the ds 
ind. Such a place may indeed be desirabl 
is directed in his journey only by such a stat 
inly be handicapped in proceeding on his j 
nd statement, namely as to getting thought from 
ayving 


ge, might be likened in the same analogy 


ce to go is to the city in Missouri where there is satis 
yarage service at the least possible price rt 


hotel and g 
oal determined upon, it might serve to start 
general direction; but how would he know w 
tay 


The state of “thought getting” is nea 


~ 


right 


tne rig 


there: 
state of Missouri. 


rther, in going to Missouri one might take any numb 
gely at his pleasure; but in guiding the child to the d 
il in reading, one may not loiter, for the length of tin 
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OF EDUCATIONA RESEARCH 
| r guidance Is pl 
it 1s ess he reaches this goal in 
four or five s, he will be greatly handicapped 
( Hence it 1s even more 
the goal be defined definitely and that land 
ess mav then be noted and ¢ 
t the ultimate objective will bi 
late 
\ goal more specific than “somewhere in Missouri 
I S larl goal in reading more spe 
the printed p 1S ibilit 
’ rn ical material at the rate of not 
{ er minute an at the same time to get the 
I duce, by answering questions or « 
ngs percent of the ideas expressed in the sel 
definite goal which mav be set for att 
le t half ot the children bv the end of the ei’ 
/ | ed such a goal, we may look for the int 
| es which will enable us to know what prog 
d this more remote objective. We 
( t rade goals as follows 
((OMPRE- 
ADI SPEED HENSION 
percent 
l 50 50 
I] Q5 60 
ltl 125 70 
1\ 150 
\ ISO 
Vi 
Vil 220 93 
Vill 240 95 


understood, are hypothetical 
» with th 
itly different from the norms 
ertained by workers in this field 
uld be considered 


such norms 


se which she uld be establ 


J 
ifiese is, 1 Is tot 
e at considerable variance 
though they are not gre tt 

Ment whi have been as 

tentatively, Wever, shi 
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until the necessary analytical and experimental 


s been done to verify them or to establish new ones 
+ amount of such analytical and experimental work ts 
In order to set a satisfactory goal for the end of the 
se, such questions as the following need to be an- 
\t what rate and with what degree of comprehension 


-erage adult twenty-five or thirty years hence need to 


read in order to live successfully his full measure ot 


rthy member of society at that time? We must set 
| terms of the future. The school must prepare the 


lult life because it will be with those, who like 
f are now enjoving greater educational advantages than 
rents, that he must live and work. Doubtless this tuture 
t be determined bv estimate trom the ascertained needs 
esent. Even these, however, are thus far undetermined 
ther questions that will need to be answered are: What 
f overlearning must be accomplished in order that the 
sidue may continue to exist after drill days are over? 


diustment of goals should be made because of variation 


f material? What other goals are legitimate objectives 
x besides speed and comprehension? What are defimite 
in appreciation of literature, such as appropriateness of 


ividness of description, aptness of illustration, et 
to illustrate what I mean from another field, the super- 


the teaching of arithmetic cannot be done well with 


neral objectives. Here again the goals must be definite 
large number of intermediate points should be definitely 


1 


ted. Teaching children to add well is vague until how % ell 

ited in terms of rate and accuracy 1n respect to a definite 
material. Instructing a teacher to develop ability to use 
lefining the ability more 


fically leaves large leeway for the teacher either to be satis 


n and decimal fractions without « 


with too little ability or to insist upon the acquisition of 


lity far beyond any need which is likely to arise in the life of 


r child or adult. 


What has been shown for reading and arithmetic is quite as 


in the other subject-matter fields. Lack of definite goals, 


ay 

ve 

it 

nd 

ithe 
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c] ‘ ‘ s vn 
\ em as Wilat ¢ 
pupils 
4 
f HE (OBJECTIVES 


| LOWS LOLICALLV ] tile 
i i 
( \ ( Ss Ol Cl 
the te ent of t 
‘ | tten and 1 re ad 
the injustice in crus 
ti initiative pervision making the t 
el ic CTVIs Goes non 
lrue supervi n ¢ does not mean that the s 
1! ] ] ] 
telis the t ner ¢ cit ns s to her every new ste 
Is the so-called penmatr 
: 
pen teach that 1s dor 
t teaching: t music 
the music te ng L hese e not supervisors: thx 
signed t l ect in all rooms instead of t 
upon the t r and tl igh the medium of the cours t 
ts the definit Is i rms of quality and quantity 
met ve te nals in the textbook may indicate met 
t 1 er define quality of work lL 
thet t and substitute real worthwhile goals which d¢ 
Having defined the goal in the subject for the Trade 
s indicated Of the landmarks along the way. th 
SUpPervis Wil cy teacher an opportunity to exer 


~ 


utiative in attaining this goal. Detroit has been tryit 


scheme whereby the supervisor is prohibited from visitit 
classroom or making any suggestion to the teacher except 
request This not only gives the teacher an opportunity t 


i 
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LIV¢ Dut al Pia S Tes] I 1] 
| ttn tne vt ] al ne oO! l@St Hel it 
‘ vl i ¢} eat rt 1) 
re needed If she realizes she needs help to ” 
and ask for s IS In thie est 
t nm the supervisor can give and hence the t e ot 
is economized. If she does not call for help and is 
tain t 1ective et, the resp ests squarely 
indicates that either she can not judge accurately a 
t r pupils or over-estimates her own o1 
nowledge her need In either of these cases she S 
hether the responsibility for 1 
( whether tilt espe SIDILITV TOr recoenilion ot 
l upon teacher or supervisor, whenever the need i. 
visor should be able to furnish 1 lust how thi fe 
eiven mav vary in terms of the personality of 
Ci) , OF need, ete CTVISOT may 
, the ciass petore le teach wiustrating 
procedure by which the desired results may be ol : 
lly d efficiently. She may, in personal privat 
\\ 1 tHe teil leaching process s id 
ol r Give suggestions which will supplement the 1deas he 
cher has already been ut mav, either 
ce or Dy letter, direct the teacher's a tention to avatl -" 
ure in which the teacher will find specific help in het $ 
[he true supervisor may use anv or all of these means \ 
e accomplishment of the goal but in all cases she must ‘ 
fairly definitely how to attain the goal or wh thre : 
iv be Tound 
CHECKING ATTAINMENT’ 
G essential Is Ck sely as ociated with the first two ts 
rvision 1s dependent upon the supervisor's knowl] 
i 
| use of valid means for checku © the extent to which i 
forth have been attained lo set forth ends to | 
without ascertaining whether or not they have been > 
hy 
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ittained 1s folly. It is worse than folly, since it will q 

break down the effort on the part of the teachers to att 

objectives. The teacher who questions the validity of 

jective or who has ends of her own which she consid 

important will quickly cease to strive tor the former wl 

learns that no check ts de ot her attainment of the « 
question 


Probably the best reason for the use of adequate 1 


checking attainment 1s exemplified in the old adage, **} 
ceeds like success.’” Children and teacher alike wish t 

t the success of their efforts If the objective tor eight 

reading has been set at 240 words per minute with 95 


comprehension, every eighth-grade child and teacher w 
interested in knowing whether that goal has been reacl 
exceeded. Every child who has passed it will feel the 
successful effort. This will be shared by his teacher. Th 
ing of goals upon an objective basis removes the personal 
vart of those who fail. Supervisors 1 
leir time and ingenuity in devising ol 
tests for measurement of success in attaining their definite 


and they must certainly utilize those tests which are 


ortant that progress be known bef 
end of the vear than at that time. Diagnostic tests w 
only show status attained but also reveal weaknesses 
supreme value in supervisory work. To the extent that 


ties are discovered early, and to the extent that no time is | 


futile effort and no wrong habits are formed, progress 
approach maximum rat The teacher should be keenly al 


these facts and ready to use the means available to check 
items. But it is the function of supervision as it is orgat 
most systems to see that these items are cared for whetl 
teacher is alive to their importance or not. Hence the supe 
must be aware of and use these instruments for chi 
progress 

Supervision without definite goals and objective means 


checking progress is likely to be of little value \ state 


~ 


| 
4, 
J 
avall ble 
| 
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phase of the topic under discussion. Rural school 
te have practically no supervision, since One county 


lent for 150 to 225 one-room schools scattered ove 

f soo to goo square miles can not possi 
even if he had nothing else to di And he certainly has 
duties. On the other hand, all cities of 10,000 or 


nd had had for a number of years, special supervisors 


ect. A single system of penmanship instruction was 
st of these cities. But definite goals of attainment 

of specific rates of speed and qualities of form were 
jective means checking ttainiment were also 
\Vhen theretore, the I ples t writing were ca efully 
he \vres scale, thy results revealed that the median 


handwriting in all grades in city schools averaged less 

half step on the scale better than the quality of writing 

il children and in the third and fifth grades the quality 


f rural children was actually better than that of city 


~ 


nilar survev in the fundamentals of arithmetic 


n advantage for citv children of only about one-halt 
verage inter-grade advance. Such lack of significant differ 
veen the achievement of supervised and non-supervised 


points clearly to ineffective supervision 


OPPORTUNITIES FOR RESEARCH 
e three essentials call for much research work and super 
daily contact with both teachers and children are in 
rly strategic position to conduct at least certain phases 
The definite goals by subject and grade can 
1 schoolroom Supervisors mav well set the goal 


nd then utilize all available knowledge in method and 


1 


gy of learning to set new records ot accomplishment 


little that has been done has shown conclusively that the 
ress that is normally made in rate of reading in two years, 

attained in less than half that time with no loss of degree 
mprehension. Richardson’s experiments in spelling and 


etic have shown that “campaign” methods enable one to 


; handwriting in lowa a few years ago shed a little light | 

| 

\ 
| 

lT 
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eed up the process « I tlaining specific cri als very m te} 
\nalysis of social needs such as the work of Horn in spellit 
the social sciences and of Wilson in arithmetic are fundar 
to the deter ion of the proper content of our curri 


Such work as that of Charters in language errors lays the 
dation for specific application of corrective measures. | 
upervisor could add to the knowledge now available 
id secure the cooperation of her teachers and their 
in a small but definite bit of research work each year. 


This naturally suggests another essential in super 


namely the ability of the supervisor to secure the whole-h: 

cooperation of her teachers. This is secured only throug! 

demonstration of a sympathetic, helpful attitude. When 

teacher discovers that the supervisor is a sympathetic sou 

help and inspiration, her cooperation is at once guaranteed 
+1 } 


il set up by the supervisor may be new and apparent! 
inable, but she will give her utmost effort knowing tl 
supervisor will gladly lend a helping hand and will either 


her how to attain the proposed objective, or cite her to the 


ture which will ex 


The devices suggested may be 
they will receive consideration because she will know that 
supervisor 1s as vitally interested in her success as in the su 
of these boys and girls. She will also know that a me 
praise will be hers when the checking is done and when it 
that aspiring to the goals and following both her own 


initiative and the advice of her supervisor the desired 


The program suggested here is applicable in any place 
there is any supervision at all. If you are a superintendent 
small place where all supervision must be done by you, it is 


possible for you to follow it. If you feel you can do 1 


select one subject; make yourself acquainted with the literatu 


in the field so that you may set up definite goals of attain: 
which will be in line with the best that is known on the sul 


be able to cite vour teacher who needs help to books and arti 
which will aid her in attaining these goals; use the means : 


able to check results; and secure the sympathetic cooperation 


~ 

~ 
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ers in doing a bit of real research on this problem in ae 
vn schools. It means some time and effort and doubtless ge 
| that vour time and effort are about all taken now. But Ee 
t to vour school, to vour teachers, and to yourselt to 7 
it means service of the est type to pot s WOT a fs 
nd it means professional development of yourself at a 
a 
Litit. 
Mee 
& 


CAN TEACHERS SELECT BRIGHT AND DI 
PUPILS? 


sen tigat f the use of rating scales | 
ngs are t rel e measures of human t 
I sing thi ing officers in the United St 
that: “I ery improbable that an offices 
hin « n | | fifth’ f the entire scale \ 1 
While t rs dl credit teachers tings 
litie it I ¢ le that teachers can select tl 
wh represent the tw extremes of a trait This papet 
cussion of an attempt to measure the ability of teachers t 
the extremes of gene! intelligence Intelligence 
In prererencs to other traits because (I) It is the only 
wh 1¢ ecasur©res il 
to check tl nes { l¢ chers’ esti tes e oft 
the selection of bright and dull pupils to be put into spec: 
(2) If we believe that the bright children should be 
nce to progress more rapidly than the average child: 


it becomes important to know whether the teachers cat 
the use of intelligence tests, select those who might profit 
an opportunity (4) Unless teachers can select the bri 


+} 


they will not know how to advise pupils intelligently 1n 1 

tion of a \ to Unless a teacher 1s able to esti 

re the bright children, some children will be advised 
fessional occupations when their ability is not great et 


to warrant their preparation for such a life work 


From the data presented here an attempt will be m ide t 
swer the following questions (1) How reliable are t 
selections of bright and dull pupils? (2) Is it more dif 


select the bright pupils than it is to select the dt 


7 
G. F. Varner 
fwr tversily / Vi 
Rugg, Harold the rating of human character pract 
J — f Fd Peyche loau, 12:4 6, November, 1921. 
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es it become more or less difficult to select bright and 
as they become older? 

hools of St. Paul, Minnesota, the teachers select the 

the lower twenty percents of their pupils in general 

in order to supplement the intelligence test scores 

ining which pupils will be put in special classes. The 

d in this investigation was obtained from the St. Paul 

1920. It concerns six eighth-grade classes containing 


ls and three second-grade classes containing 80 pupils 


rs were given a definition of intelligence and wert 


lect the best and the poor st twenty pe rcents in thei 
[he eighth-grade pupils were tested with the Haggerty 
Fxamination, Delta 2 and the second grades wet 
th with the Haggertv Examination, Delta 1 and with 
Primer Scale. One of the second grades was not 
he Delta 1, so in this case the results were for the 


ils were translated into intelligenc: 


l the teachers selections were com] ired with the in 


tients. as well as with the actual test scores. It 
that the te ichers’ selections con] ared more fave rably 
ntelligence quotients than with the actual scores. Si 
( t the intell gence quotients are taken as all rdin 
! classification to use and the results are reported for 
ligence quotients only. Since the teachers were directed 
he highest and the lowest pupils in their grades regard 
ene ral intelligence level of their | upils, the con iT 
be made for each grade Se] arately The con parisons 
le by dividing all of the pupils of each grade into twenty 
classes according to their intelligence quotients. The 


nd dull pupils as sel cted bv the teachers could then be 
within the twenty percent class to which thev belonged 

to the test classification. The results of this compart 
shown in Table I for the eighth-grade classes and in 


for the sec nd grade classes They are summarized D\ 


in Tables III and IV 


ry 
1) 

+ 

t hee 

v test only 

Phe scores of alll the pez: 

eT 
tad AYS 

I r¢ 

% 
4 


FF THE UPPER AND LOWER TW 


RISON | I 
CENTS OF 1 PILS AS SELECTED BY THE TEACHERS 
CE TIENTS AS FOUND FROM THI 
ELLIGENCE EXAMINATION LIGHTH ¢ 
ny { 
| 
1] 

1) 
t 2 l 
l 
l 
VI 
| 9 

20 12 9 5 

i ver 20 
| t l 10 
20 
(2 25.53 19.15 10.6 
P 
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| 4 COMPARISON OF THE UPPER AND LOWER TWENTY 


S OF PUPILS AS SELECTED BY THE TEACHERS WITH 


ELLIGENCE QUOTIENTS AS FOUND FROM THE HAG- 


ELTA I AND PRESSEY INTELLIGENCE EXAMINATIONS 


GRADE. ) 
( INTELLIGE? 
First 20 2 20 | I h 20; 
l 
20) 
l 4 
r 20 
l 
r 


r 20 
erent { 9 
r 20 
ryt 99 m 99 lt 65 = 
a | r 20 
nt 17.24 23.53 5 83 52 


le I shows, for example, that of the eight pupils selected 
teacher of School I to be in the upper twenty percent 
n intelligence, four were put in that class by the intelligence 
nt classification, two were put in the second class, one was 
fourth class and one was put in the fifth or lowest 


1 
al 


he teacher to be in the 


Of the seven pupils selected by t 
twenty percent class in intelligence, five were classified 
me class by the intelligence quotients, one in the class 
nd one in the middle class. Table II gives the same data 


second-grade classes. 


a 
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a 
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\BI Il! \, COM \RISON OF THE ABILITY OF TEACH 


LECT BRI {T AND DULL PUPILS (EIGHTH GRA 


r i 
1) ] i 
\ 
Aco RISON rHE ABILITY O PREACH 
) ULL I PILS (SECOND GR 
| SAM Dit 
( \ INTELLI 
i) iW 


how the percent pupils concerning whom the teac! 
t tes agree 1 with the test scores ‘hey also show t 

OF pupils ¢ whom the teachers disagreed wit! 
results and the amount of the disagreement. Table II] 
that of tl selected by the teachers to be in the up] 

cent of all the eighth-grade classes, about 43 percent 

wise pl ced in the u per 2 ) percent or these classes 


their intelligence quotients. If we may assume that the 
ults are correct, 25.5 percent of the pupils were misrate 


eachers to the extent of one Class, 19.2 percent to the 


f two classes, etc. Table IV gives the same summary 1 
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T 
| 
| | 
PER r PupILS IN SAME oR Dirt 
\ IRDIN ro INTELLI 
(ne lwo Phree 
l | 
Lower ty per 
Tables II] and IV are summaries of Tables I and I] | 
| 
second-grade classes 


now rn to the tl questions which we sed 1 
ng of our article. From Tables III and IV we ol 
hers agreed with test results in trom 20 to 60 pr 
cas n which tl V i ssed udg ent Their dis 
_ however, were much closer for dull children than 
bright childret In the second grade about 53 
1a hth ile bout 62 percent ot the teache 
t pupil belonged in the lowest 20 percent were v« 
telligence quotuents the ther | nd, whet 
ed that pupil belonged to the best 20 percent « 
Ss, they rreed th the intelligence quotier 1 uch 
in but 22 percent of the cases in the second gt cle 


t 43 percent in the erenth grade 
7 7 
ears to be appreciably more difficult for teachers t 
mentality of puy in the second grade than in the 
] th 
lable IV when taken in comparison with Table III ex 


wh 


ers who participated in this 


s tact so tar as the tea 


ition are concerned. In round numbers 20 percent more 


i 
cht d 10 percent more of the dull ch ldren were cor 
lected bv the teachers in the eighth grades than in the 
des In another investigation by the writer it was 
the teachers’ estimates of the intelligence of 171 third 
tients while the 


ils correlated 0.43 with intelligence quotients 
s of intelligence of 89 seventh-grade pup 


th the same measure. « yrrelations substantiate the 


n to be drawn from this investigation, namely, that it 


ifficult to estimate the intelligence of pupils as they 


the extent that the results of intelligence tests may be ac- “A 
s valid and the judgments of these teachers may be re- 


s typical, the following statements may be offered : 
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SUMMARY 


1. Teachers can select 20 to 40 percent of the bright pupil 
in their grade and from 50 to 60 percent of the dull. | 
It is more difficult to select the bright pupils than 


to select the dull. 


It becomes less difficult to select the bright and the 


3 

Is as they become older. i 
1. Selection of pupils for special classes by teachers’ 

mates is a very unreliable and unsatisfactory method of 


re and should not be used if it is possible to use an int 


les 


ce test. In the selection of pupils to enter classes for 


children, teachers’ estimates of intelligence are of practically 


5. From this study of teachers’ ability to select the intel 
gent and the unintelligent children, inferences may be drawn 
to their ability to select pupils according to other character traits 
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THE MENTAL AGE OF ADULTS 


Epwarp A. LINCOL? 
Harvard U neve sity 


results of mental testing in the United States army dur 
late war left one vexing problem for the psychologist 
ta. taken at their face value, indicate that the mental age 
lults is not sixteen, as assumed by the Stanford Revisior 
Binet test, but only about thirteen and a half. This port 
me somewhat controversial, and has been recently 
ked by Doctor Terman, who states, in the Journal of I 
i) Psychology (vol. xu, no. 6, p. 330), several reasons 
he believes, invalidate the inferences which may be drawn 
the Army results. It is the purpose of this paper to inquire 
the validity of Doctor Terman’s objections to the interences 
the army results, and also to seek further light on the 
problem. 
ng the objections which Doctor Terman ofters, one ot 
st potent, if it can be shown to hold, is that “the 653 
d men! for whom the average mental age of 13.4 
cannot be taken as representative of the entire dratt 
The assumption is that these men were of inferior In 


gence as compared with the draft army as a whole sin 


ce, 
lisproportionate number of them were trom southern and 
southern camps where, according to all the results of Arm) 
ng, average intelligence is lower than in northern and 
rn states.” 

This assumption is not supported by the facts, which are 


wn in Table I. Here we find that the 653 group, when com 


rder to obtain data regarding the comparative scores of 


ns used in the Army, requests were sent out from the & 
to the psychological examiners in 12 camps that 100 in « 
unselected, be given Alpha, Beta, and the Stanford-B 
ns were given for the most part in August, 1915, a1 1 of 74s 
ed, 653 were born in English speaking countries. For full t s 
S R M.. ed., Psychological exramtnwmg wen the Uv ited Stat 
M irs of the National Academy of Sciences, V. 15, 1921) p. 354 ff. ' 


Army report’’ in this article. 
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e sa whole, has a daistine 
i b rdless of the ¢ mi 
O52 ¢ larger rtion of C ni 
t ll A, B, and C+ men, the comparison ag 
= lable I also shows that in regard to tl 
e] ( t e is little difterence 
I ( WHIT DRAFI \ oO Os 
¢ AD 
RADI 
\ 
| ( Cc B \ ( 
1) i.1 17.0 23.8 25.0) 15.2 S.0 
\ 18.1 | 8.4 19.3 21.3) 18.2) 8.9) 5.8 
6.3 21.0' 21.8) 20.8 15.9) 9.2 l 
St Bins 15.6 23.0 22.4] 11.9) 6.0 
f $ Army report for 
ure f 7, pag 
C : v also be made on the basis of the « 
cores for the two groups, which may be found or 
3 . 1 333 on pages 789 and 790 of the report. In gs] 
that the 653 group has no scores on the combined scal 
21.9 while the White Draft has 4.607 percent of its score 
t it, the smaller group has a slightly higher medi 
iS ( pared with 13.46 for the White Draft. This diffe 
persists when translation is made into mental age; th 
for the 653 is 13.25 years, as against 13.15 for the gr 
| the principal sampling. It may be fairly argued, of c 
t differences are so small that they are not significant 
ever, this evidence, though slight, 1s cumulative, and vw 
facts in the previous paragraph it seems to indicate unequi 
. that we cannot assume that the 653 group was inferior in get 
intelligence to the draft armv as a whole. 
\ furthe ion which has been raised to the use 
Army results is the assumption that “the draft army itse 
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he intelligent 
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positio 
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representative ol the 1 ic | etTween Lilt 
nd 21.” largely because it 1s lieved that deterred 
nd exemptions were, for the 1 t part, granted t 
pet r intelligence pat It 
proof of this assumption, Dut it 1s | sible to point 
ther fundamental considerations which Doctot let 
to take into account when he 1 de it 
report. on page 8 3o, presents figures concerning 
y re] pag 
ns nd deferred classitu tions \ ( I gathered 
ER OF EXEMPTION DI SSIFICA- 
Dependency 51.1 
\lier giance 13.16 
In ser 
Disability 
() tion Ss 
Relig 5 
p. 20-21) 
Liv ‘ «ft AV Cit Lilt SA l Ss, «ll i 
can. anv reasons for believing them to be composed 
i 
5 Who \ vy to anv great extent on an intelligence 
population as a whol 
one lf the exemptions and deferred classifications 
aependents Proba lv most ot t 1 were cranted to 
were married and were supporting their wives and 
l< this group superior in int lhi ence to the po ul ition 
Lill 4 ii Cilia Lil | ] ll 
( It scarceiv seems that we can make this assunllp- 
‘t would be hard indeed to prove it. Such evidence as 
is seems to point the other way Che more intelligent man 
longer at his education, and 1s thus kept from early mar- 
ver, he usually Wishes a pel pl: eon a Tela 
high level before he takes the responsibilities of fa nily 
HEE man is also more often successful in his 
k and manv of the 
ind manv of the men of this Class were in such | | 
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that they could not consistently claim exemption on 


that they needed to continue their work in civil life in 
support their dependents. On the other hand, the k 
ent man, v it bition or foresight, marries earl 
large family. Thus it seems probable that the exer 
this ground did not keep the intelligent man out of t 
rl but rather pulled him in, since it was more likeh 
was unmarried, or if married, that his family was inde 


f his continued occupation in civil life 


I.xemptions in the second largest group were 


I 
allegiance, that is, for those men who were not citizens 
United States \re these men above the average intell 
The Army report (part mm, chapter 6, page 693) 


ults of the examination of some 12,000 foreign-born 1 
were included in the draft armv because they had either: 
citizens or waived exemption. They were, as a whole, d 


inferior to the rest of the draft army, as the figures in 7] 


will show It is, of course, impossible to be sure tl 


foreign-born men who were drafted were on the sar 


ence level as those who were not, but there seems no 


7 


believe that great differences existed. The only bit of 


rABLE Ill PERCENTAGE DISTRIBUTION OF LETTER GRA 
BY FOREIGN-BORN MEN IN THE DRAFT ARMY 


2.9 


S 


FOREIGN 14.8 | 30.8 | 16.5 | 26.6 4 
Wrote Drart 


of which the writer is aware is that the conscientious 

who refused to serve in the army because they claimed n 

citizens, because they were enemy aliens, or because they 

enemy alien relatives fighting on the other side, were, as a 

of very low intelligence.* This would seem to indicate tl 

more intelligent foreigners came into the army, especially 


‘Army report, p. S02, section 2. 


k,D- D C C+ B \ 
+1 


ed were superior to the men of like education and high 


vith the fact that the men w were natu! ili ed nd 
e to the draft, were in all probability superior t 
f the foreign-born The same is likely to be true 


n who waived exem 


Ler n believes that the 1 1 CX¢ pted Ir thi 
se thev were already in the service t eda perio! 
“these vi lunteers ncluded dis] rtionate 
college graduates, college students, recent graduates 
ls h h-schi studet nd high-1 nded thes 
I cl sses vente lly 1 \ be true | 
ccepted nless accor nied caer e p 
1s a disproportionate number of college, high-school 


inded youths in the group, and sec nd, that those who 
| 


ss who waited to be dratted The declaration Ot war 


me 200,000 men in the United States Army, approx1- 


thirds of them regulars and the rest National Guards- 
id been called for service on the Mexican border 
one familiar with the type of man who joins these 


1 
} 
i 


ns will contend that, with all their virtues, these men 


any part of “the intellectual cream of the country 


up comprises about a third of those exempted because 


enlistment The ot 


i 


1ers were men who had enlisted 


call for volunteers was issued, or sometime before t 


ts were closed. Can we declare that the men in this 


came, say, from Harvard University were more 


stice } ‘ the ata lf 
i of cs T if 
s the Vil es ts ‘ t 
t Indianapolis, Re lars N il Guard 17; at Nas! 
res to 115, 121, and 15 
14 which in score of the « 
+ trial ps was 151. Mosq fleet t Brooklyn 
76 t ine] l ‘ ‘ es 
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oe 
tion. Thus in this field the 
t of evidence seem to be that if the general population 
not well re] resented by the draft army it was because the 
nt rmv were of a higher, not a lower, intelligence level ; 
the men who were not drafted 
tute 
iit 
-6 (162). Army report p. 515, Table 1, and p. 491, Table 145. 


‘ 
( 
men, the adv rs, and th 
t rm ule t men 
Vi ‘ I tiie 
\ ail 
teered 
] ‘ ] 
ne exemptions 1 maustl 
+} 
but 
e dor ot 
quite tes ! e tO eree With 
t ] nmin | on yr 
s \ It may depend | 
il or up the social, political, o1 
I Ol Cll i 1 Niel bet Ol his I 
ere ree auring the lion OF the seiective 
t exempt s were secured in this W 
these consid t ns t. | it Is 
lent Of Lie pe Sitions Of i1CTsilip U l 


easure, at least, to the length of time that a ma 
working tn his particular line. Men do not become 


“directors,” “comptrollers” and the like until they ha 


Cr cit e degre of experience | robablv, then, the 
ere exe pted in this § up were not more inteliget 


Some evidence for this conclusion appears in the Ari 
port in the chapter dealing with the intelligence ratings 
pational groups." It is here stated that, “consistent sigt 
differences in mental ability between apprentices, jourt 
and experts are not present.” In only a few of the o 
le experts score higher than the journeymen, 


journeymen higher than the apprentices, and many of t 
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ntelligent than the Harvard students and graduates 
later drafted Decidedly not without some pretty clear « 
ul 
fellow s, they were simpi\ older 
*Report of satatisties branch of the ceneral staff, I 17 
‘Army re} rt, | l ff 


an 


viated s 
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e severe 


red that 


linguisti 


re 1c! ly 
to 
n tl precet 
elligent thar 
re nd ( 
mental, and 


cale, he believes, gives lower 1 
untrained 


omplete scale, and 


in their scoring. On the 
1, 


the omitted tests were for 


c in their nature. They were « 
lat many of the men who wot 


al unfitness 


ectual cream « 


be cf nsidered as veTag 
In the first place, this group 
1 the basis of interior men 
ably bring the g » below 
ndency « mentally in 
nd und tedly some of 
were of inferior mental 
ell. For some groups of physical reject > we may 
he average entalitvy was low, su the Army 
s out the mental inferiority of venereals and men 
hookworm The sum of the evidence seems t 
exemptions in this gr left the inferior men in the 
it nd put the st lor men i the army 
ection that the low scores were caused by unfavorable 
examination is refuted, in part at least, by the 
t recently inoculated men did as_well as men whos 
was taken at a time when it was in ssible for it 
n at the time of the examination.” 
arguments by Doctor Terman deal with the form 
tanford-Binet which was given in the army, and the 
f the examiners. These he presents without evidence 


nental ages for 


examiners are 


point it should 
most part those 
mitted because 
ild be tested by 
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hich do exist are small the apprentices 
her than the experts I] what would be 
trom the Cc nsiderations ng paragraph 

ted men were no more il the beginne.®s ; = 
simply farther along in ence. 

n exempted for physical, —___| 
nnot be considered a part of the “intel lI 7! 
bt 
supposed tl 
p. 810-11, 
9, 
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Is could not do themselves JUSTICE 
r, the tests remaining were, Int 


1 
| nake better ree 


t ther than lower on the 

that such would have beet 
n tl rmvy who could not 
second point it must be 1 

( 1 n was made t 

he tests in which the examiner has 
the response For ex 

luded all the vocabulary tests, the n 

d tl d erences between abstract we 
erit king by examiners of s1 
called 1 question. It has never 

f students at the Psvcho-k.ducatio1 

duate School of Education. We hi 


jection to Doctor Terman’s 


t so much from these arguments th 


~ 


el 


he fact that he failed entirely to take 


uch arises from 


lave equalled or surpassed the 


men. TI 


mificant because they are open to none 


mination has been given to school 


1c results of these testings in 


another sour 


concerning the fact that in mat 


are raised in connection with the Stant 


and Cowde 


rv, K. M 


adults.’’ School and Society, 10: 708, Decem 
lity +) 
90 discusses the reliability of the fe 
‘ 
ust 


hich the men were 
e rather than mentality. The making-char 
imple in this connection. It is a nine-year t 
ion, but it was passed by a large percent 
the Harv ve t 
ncy in marking to be largely a matter 
ner 
ous ob arc: 
the thirteen-vear-olds perio 
| — 
Ibid., p. 408. Also, Lincoln, E. A. abbr 
mental age scale for er 1 
“The Army rej | ts 
and seems to justif) 
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valid point raised by Doctor 
be too difficult in the upper ranges 


periority of thirteen-vear-olds over army men ap 
hen the Examination a, the original group ex: nation 
rmy, was given to groups of California school chil 
is t be re el bered that the scores ot all children 
r of foreign-born parentage were eliminated from 
n. but to offset this, in some measure at least 
t the scores for the enlisted men are t only thos 
re considered literate. Care must also be taken not to 
the error of considering the enlisted men as a group 
red low partly because it included I officers since 
p were afterwards sent to officers 


the men in this grou 


Is and commissioned."! 


Army report the following results of 


rded: 


Median score 399 12-year-old children, grades 3-9 1558 
274 14-vear-old 207" 
56.140 enlisted men, literate, from 
Camps Devens, Dix, Lee, and Taylor. 161° 
Army re rt, Tat 
491 Tat 14 


se ficures seem to indicate that, even after considerable 


ve 


e for all sorts of factors, the army median cannot 


dered to equal or exceed that of the thirteen-year-old 


are not in the Army report so far as the writer 1s 
e, any results of the scores made on Alpha by school chil- 
[his examination, however, has been used in a number ot 
Is since its publication, and the results have given further 
nce that the mental age of adults is on a lower level than 
d supposed before the army 

The median Alpha score for 67,254 literate white men 


tests gave us a hint on the 


loctor Terman falls into this error when he states that all officers were 


from the 653 group. 
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Lil 
and we need not even take account of the one unques 
n¢ 
In the fxamination a 
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60.5,'* and the median for 51,620 native-born whites 


Doctor Doll examined with Alpha 514 children fri 


les of the Franklin School in Trenton, N. J 


Mzpman S 37 «61 85 80 
Nom 29 iS 63 % 139 7 
| this s 1 the veal lds equ | the 
Fain it sé 1 h many allowances thi 
not t ist be re embere 
tior e above figures t the medians for the 
tedly low, because number of the m 
twelve- and thirteen-vear-olds have gone on into the 
! t of the fifteen-vear-olds and many of the 
vear-olds left in the grades are mentally inferior. 
\ p of rural New York children selected 
f 1 grade tl wh | ) school and ranging 11 
Ve t] ll Vil COTES 


9 10 11 12 13 14 15 
ALPHA SCORI 35 1} SI St} O5 
This vives us a more accurate idea ( f the scores 
upper ages, and again indicates that the average soldie: 


tainly not much above 13 vears old 


If we turn to high-school results, we find the foll 
ported by Sylvester and Dobbs."* 
ADISON RocKFORD S10 
Cha NI (\verage Median Average | Median 
Score Age Score Age Score 
121.2 i7 123.8 17.5 137.6 
Jur 1992 4 16.6 119 7 16.4 130.2 
Sopl 109 7 15.8 107 0 15.3 122 1 
Fres 14.8 0 14.5 107.9 
4 Tal ] ible contains a la 
I l ‘ 
uy} » 
‘Dp i \ The A gy M al Age fA ts J 
7-28 Dece 1910 
I t t use f the Army Alpha 
Sc} is 13: 27-30 January 1, 1921 
Mads I. N., and Sylvester, R. H High school students 
utings ac ng to tl Arr Alp! a test School and Socret 
October 4, 1919 


follows :'* 
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} 


st not forget in connection with these figures that 


| pupils are a select group, but it does not seem that 

ne can account for the fact that the scores of the 

of the sixteen-vear- 

in 100 percent above the average score of the arm 

e contribution in this field has been made by Kohs 
They report a set of Alpha norms based upon the 

n of several thousand California school children. Of 


thev sav. “The children tested were drawn trom al 


nmunities. and represented every variety of social 
t part of the table which presents scores for twelve- 


12-4 13-3 
55 12-8 13-6 
ti 13-0 14-0 
H5 14-3 
70 13—S8 14-6 
50.1 Median of 653 group 
58.9 : 51,620 native whites 
6005 67,254 literate whites 
esults indicate t tl rmvy mental age norms were 
1 ind that th verage man may have been slightly 


evidence piles up. The writer, however, does not wish 
through this article, his conviction that we should, 


urther investigation and analysis, adopt 13 or 14 in 


rth the fallacies in the arguments now generally ad- 

nvalidate the army results, in order that the question 
pt open and studied scientifically. If we are to use the 
ready in hand, the crux of the matter lies in the answer 


uestion: To what extent was the army typical of the 


hy | 
Tl 
rteen-vear-olds is here reproduced: 
MeNnTAL AGE Norms 
Kohs and 
Proctor Army 
+ 
irteen mentally 
nve' 
| of 16 as the average mental age for adults. His purpose 
tor, W. M.. ‘*Psychological tests and quidance of high-school 
Journal of Educational Research Monograph No. 1, June 1921. 
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population at large? An analysis of some of the evider 

heen presented in this article Much further light car 

tained b the udy of the psvchologi | records I \ 1 

the National Ri rch Council. The study might 

with thre nation of the records th men wh 

twenty-one or twenty-two vears old at the time of their 

to the army In this group the exemption for mart 

occupation would p1 bly be low, and we would be rid 

troublesome factor. The examining of school children s| 

continue, with care that the examinations are given far e1 

down in the grades, and far enough up into the high scl 

nclude all the children of ages 12, 12, 14, and 156, tl 

1! iny of selection 


ATTEMPTS AT TEST VALIDATION 


urement in education has passed its infancy. We n 
d to plead for recognition of the cardinal importance 
exact indices of the quality which we are describing 

tention has been diverted from the question of whether 


d ne, 


to the question of how it shall best be done 
ising popularity of tests promises an early verdict 


school men of America regarding the practical conse- 


nd immediate advantages of such measurements. This 
will be transmitted to us in terms of the tests that are 


in terms of the tests that could have been used. It 1s, 


| 


e, of utmost importance that the tests be more carefully 
ucted and fairly evaluated. 

inistration is not scientific in its choice of tests. Geo- 

preferences, advertisement, and the authority and 


he author are prominent elements in evaluation 


i 


trative exigencte s, such as the price of a test, the time 
sive it. and the convenience of its scoring facilities, 
ide a sale for tests whether they have other virtues or 
Such considerations should, of course, be entirely second- 
e price, time, and convenience can only be of value if 
st renders the service that is needed; and exact statistical 
must form the basis of sound judgment. Cost should 
nsidered only in relation to value. 
hief among the fundamental virtues which determine the 
test are, first, that the test shall measure what it pur- 
to measure, and second, that it shall be reliable. It 1s very 
int that a test have objectivity, that it be scaled, that it 
usable standards, and that it exhibit certain other second- 


iracteristics: but if it is to be a test at all, it is essential 
t tell the story which it pretends to tell, and that it tell this 
story upon similar, successive occasions 
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\n attempt is here made to determine in respect t 
the degree to which our so-called 1 
t they purport to measure. In this « 
Lile ¢ ect ON Tk rel 


Scriptive a feature of the criterion 
n t e it iscluines the test 
} \ ts to predict 
im fit nd value 1s judged upon 
+} ] ] 
he « 1¢ test given in the kindergarten 
racic sil be Ca ed \ Nindel I 
G rather than “An Intelliget 
Cl cil cuit (i ( what Cit Ls 
1 1 validating our “intelligence tests It is quite \ 
ie ot possibilities that the various authors ot tes 
radically in their conception of what minutiae of beh 
ine to 1 the power of adaptation which by their 
tempting to measure. It is, therefore, very « 


sible in various ways, but a wholesale deter 
relative value means the choice of some easily obtai 


tion and te clit description of children is imp1 
The use of an individual Binet test is not essential f 
e since tl erage of a dozen group tests would 1 
bably be as eood at isure of the intelligence of cl 
the results of a Binet test The unreliabilitv of marks 


et 


ents of the intelligence of child: 


1 
teachers, and ot tne mde 


teachers, makes these unsatisfactory media for comparisor 


used negatively, though it is an indication of how long 
taken the children to get as far as thev are, is insufficient, 
children are promoted for reasons other than readiness 
on. The use of the totals of all the tests is to some 
reasoning in a circle, though nearly everyone would grat 


er: ter “intelligence.” We might well substitute 
choose criteria which will tell us how nearly the tests d 
ob \ more intricate and refined evaluation of any o1 

cit 


nte ligence 


ise of these elements as 


nner the efficacy of the tes 
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rf twel\ e or fitteen tests ot power ot ” idapt ition to 
is a fairly accurate indication of what is ust 
We should be willing at least to admit 


which correlates | sitively and high with the total 
r of tests is a better instrument than one which corre 
postively and low or negatively—though this might 
test in question measured something differen 
this “something difterent 


do not know what 
n almost useless reservation. The intercorrelations 


lations of a test with the t tal, irrespective 


corre 
s. is another factor which contributes to evaluation 

use of partials would logically follow the determina 
ery cceptabie criterion so that we might then seiect 
ot tests to do the work which we e detined 


ntable criterion. 

criteria will establish in a 
ts. In this investigation we 
ned fourteen intelligence tests with respect to such 
I indicates certain correlation data. The spread 


i 
is not great; largely because the material concerns 
on school class Phe erage age otf the hity-seven 
took all these tests was 168.82 months, and the 
deviation of their ages was 10.73 months. This point 
the variation in age—is important. We do 1 
to find how well a test distin hes a1 1 tes 
ma normal thirteen-vear-old as to find « how well 
listinguish a stupid thirteen-vear-old from a_ bright 
ear-old We need an instrument which will d 
neural quality when neural change due to age 1s constant 
f the criteria above mentioned have been used here 
f their insufficiency in order to make crude distin 
veen our better and our worse media of measurement 
rence, leaving for a more detailed and intimate investi 
he discrimination between the better ones, and the de- 


on of what it is that the others test. These data are at 
ere indication of even this crude distinction As | 
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‘ria. As to the method of obtaining the teachers’ judg 
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ready stated, there are only fifty-seven cases involved, 
umber of correlations used, the interpretations of which 
nt each other, makes this a less serious objection We 
i with a whole freshman class in one high school, about 
ired, but before the administration of all fourteen tests 
pleted the number taking all examinations was reduced 
seven. 
| gives the correlation of each test’ with the total 
by ten for ease of statistical computation), and the 
rrelations of all the tests. The total was obtained by 
the scores of all of the tests except the WW ylie Scores 
test had not yet been tabulated when the total was ob- 
Each test was roughly equated with the others in vart- 
before the addition. Table I includes for each test the 
deviation, mean, and coefficient of variability (stand 
iation divided by mean). 
Je 11 gives the correlations of each test with the total, 
intercorrelations of all tests, with reading ability, as 
1 by Thorndike Alpha-2, rendered constant 
le IIL gives the average of the correlations of each test 
he other thirteen. This table is a rough check on the first 
of Table I. 
ble V gives the intercorrelations of the elements used as 
received the first semester in high school. Column 4 0! 


same table gives the correlations with the average ot 


rs’ judgments of the child’s power to adapt himself to 
] 


nvironment. These judgments were rendered after at least 


a semester’s acquaintance with the children. Column 6 
able IV gives the correlations with age at high school 
nce, or the length of time it took the children to get to high 

Naturally these correlations are negative and the highe 
‘gative “r” is, the better the test is in that respect 


11 


le V gives the intercorrelations of the elements used as 


Dearborn examination for higher grades was divided into tw 
s since it was twice as long as an ordinary test. The first half was 
alled Dearborn-1 and the second half was called Dearborn-2. 


| the 


SITAM “ET 

LAO al 
SIONITIT 

‘OT 
“6 

~TVNOLLVN, ‘8 
SLL() 


V~IVNOLLVN, 
re 
I 


YALAWILNA WW 


150 JOURNAL OF EDUCATIONAL RESEARCH j 
ont 
- 
ONS 
ROSOS 
= SOMOS OO & 
om 
y | CODCOD 
| 
» 
> 


ATTEMPTS AT TEST VALIDATION 151 


the ability of children to adapt themselves to 


ment the following instructions* to the teachers 


‘ 
i rey ) 
‘ 
Wr tt u t 
\\ 
‘ ‘ 
\V ‘ 
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( 
\ score 
} 
‘ \ 
10) V 1 
t brig ect 1 
ihe tests show a very good nk with resp Lo 
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eat except ne | e Il, which means that it 1s 
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ourse, not inclu led The Otis, the H rarerty, 


it Is ( 
| the National tests give a fairly good account of themselv 
timeter has the highest correlation with the totals « f 
tests, when reading is rendered const: 
in saving that if we wish to use a non-verbal t 


uld use the Mentimeter. The Wylie stands very high 1 


bles, in consideration of the fact that it represents onl 
minutes of testing. Its high negative correlation wit! 
dicates that it measures position within an age group better 


instructions were prepared by Mr. A. W. Merrill, the principal 


if 

al 
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than growth itself. The presence of this desirable featur 


the Wylie is verified by other data showing a wide spread 


age groups. The standing in the tables of the Myers an 


two Dearborn tests indicates the comparative inferiority 


fined. The two Pressey 


tests in terms of criteria as here de 


vary too much in the different tables for us to risk a jud 


Table II can be usefully employed to give indications 


ree to which the prediction of a test 1s valuable, apart 


the prediction which is rendered by reading. It shows 


ciation of the remnants when agreements in terms of 1 


~ 


are “ironed out.” We are most interested in that kind 


diction, since school marks and teachers’ judgments of int 
rence are based largely on ability to read, and we want 
those who are capable but who cannot read. The Th 
Reading test and the Illinois General Intelligence test cor 
highest with teachers’ judgments because they both involve 
ing to such a great extent. The Illinois General corre! 


high with the reading test that we may say that it is a me 


Ol reading 


The intercorrelations as they stand in Table I and 
stand in Table II, with the reading partialled out, are valua 
determine which tests to use together. Unless it is met 


more reliable result which we are seeking, we should n 


tests together which are highly correlated. If the corre! 


presented in Table I are valid, Otis and National-A shou ' 
be used together. Terman, Thorndike Reading, and Cross 1 
or Wylie would be a good team, since each is of some ; 


according to these criteria and their intercorrelations are 
It is not out of place here, to add that the size of the « 
lations with these criterion elements is no reflection on tl 
but rather on the elements themselves. Even if we wet 
admit that the school marks are a desired goal and even 11 
marks were reliable enough to yield a high correlation 
should still be rendering a better prediction of this kind 
success by our best intelligence tests than expert corn 


render of the yield of corn when they judge it at our Iowa 
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AVERAGE OF CORRELATION WITH THE OTHER 13 TESTS 


[ERMAN 


NATIONAL-A 


aj aj oj oj 


HAGGERTY 
[ILLINOIS 2 
l 
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CORRELATION OF EACH TEST WITH CERTAIN CRITERIA 


AVERAGE 
OF TEACHERS 
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INTERCORRELATIONS OF THE ELEMENTS USED AS 
CRITERIA AGE IS USED NEGATIVELY 


MARKS |} JUDGMENTS Pora.s 
MENTS 82 
I Al 34 58 
tS $0) 


MARK: 
oO} RKS 
fARK AG! 
rHE FIRST 
POWER TO 
SEMESTER 
ADAP 
Rank Rant Ran 
. VEY 22 6 
M 12 14 23 14 14 13 


rrelations of totals with the 


indicate the limi 


mpie, since the avera; 
ly o.s8 with the sun 


with the judg 


ria is indicated by 


ike their estimates of the product attained in their « 
they were like actual manipulation of new mater! 
children. Besides this, as the teachers’ judgments 
ment in these correlations and their reliability 1s ver 
( t expect very high rs. A correlation coefhcient 
by the relial ilities of its elements. The 
nv test with others cannot go higher than the squ 
correlation with itselt 
| hy tahl he "11 tint 
nes t es tt pt inswe three questl 
crud rik 


Reading. Illinois. the Nationals, Otis, Haggerty, 
cheap and short test is wanted, the Wylie Opy 
(2) Which test are best to use if intelligence, irre 


reading, is the objective? 


The answer of these data is Mentimeter, 71 
(3) Which tests should be used together and which 


The answer is to be found in Tables I and I] 


se data provided by H. A. Wallace of Des Moines 
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lhe correlations of four judges’ scores of ears of cor 
vield Ot those cars were all about 0.20.” 

The inter Soothe: 
of the criteria given in Table \ ER tatio: 
other elements For ex: e of t 
judgments correlates only 13 1 
is undoubtedly something wrong ents 
their insufficiency as validation crite ii: || 
correlation with the pupils’ marks The marks ar 
[Q21 The judgements are of December, 1921. The 
os And still the judgments correlate only 0.58 
totals ot 13 tests This means that the teachers’ est 
intelligence as defined in the instructions were very mu 

t) Which tests are most likely to be good for all-arour 
The answer obtained from these data is Terman, 
| 
hese 
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extensivc and more intensive investugations OL Uls 


1 


vill yield us fairly accurate determinations ot the rela- 


f intelligence tests. In addition we must caretully 
the quality of the quantitative messages that our 

transmit. Here our methods are more clearly « 
s even p ssible in some cases to dec e bv at VSIS 

measures some definite quality thougn limits ol] 
thod of validation are very restricted \rithmetu 


test arithmetic on the face of it, though it 1s instructive 
re Woody. Woodv-McCall, and Courtis; and then 


mind their differences, to hear t d prit 
k of arithmetic scores, not “Cou cores Of 


of our well-known reading tests correlated as follows 


Baal 
of 40 children 


ugh much more work with these tests 1s necessary to 
ne their true correlation, it is obvious from the data at 
we need to investigate our reading tests. In a recent 


1 


f the Journal of Educational Research Courtis says 


ld who makes a high score in my test and a low score in 
need to be given a series ot lite « | +} 

expe es. To call Met o t 
Ss denci can be made u 

true 


¢ 


not know whether this criticism is meant to extend 
rndike Alpha-2 or not. The type of the test is exactly 


( But 1 do know that a child’s deficiency 1n_ th 
2 can be made up by practice in reading. In an invest! 
’ it Garden City, New York, the c rrelations and difter- 
between Intelligence Quotients and Reading COuotients 

are correlations obtained by Miss Minnie Thurber of the Pul 
Los Angeles, California. 


tis, S. A. ‘*The analysis of reading ability.’’ J¢ f Educa 


ch, 4: 292, November, 1921. 
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divided by Chronological 


e of Binet scores and 


raining in reading, 1 


important way of validating a 


the test with mental ability ap 
subject is given? If so, the product test 


1ate Woody-MeCall  Arit! 


used are adequat 
and Stanford Revision Binet do te 
this standard. 
May Ayres Burgess in the 20th Yearbook of 


Society for the Study of I:ducation says 


Since TB is the usual abbrey mn for tu 
tw itiais losely cont ru is to tl 1S celebrated emblem \ 
Tuberculosis Association In doing so, be caretul t 
will result the separation of the initials by 
n ween tne 
Phird-g children, Italians, on the lower « 
York City, w were ¢ 1 by their teacher as except 
re y 1 we not ther elves able to use Eng! sn i 
ragray t, and finally with shi ig eves 
[ ; ! ed 1 ¢ e side ind B on the the 
vded against the « 1 cross. They could not have 


but they could discover what the paragraph meat 


ign tir t it out ihe same result occurres 
over, with I ceria eve sucn as, u I 
‘ + ¢ ten? 
oT t i tne chnildl re peti e an 
t I il ate anterior to the buccal orific: 

may procure aliment 


But when such a picture as this is put before third 
children without any reading matter and the examiner 
“Do something to the picture,” many of the childret 


Savs, 
just this 

If Picture Supplement-1 is given orally (that is, if 
pictures are put before the children and the instructior 


lly), and if Picture Supplement-2 is afterwards gi 


according to instructions, except that plenty of time ts 
“Burgess, May Ayres. ‘‘Controlling factors in the measurement 
rea ng fut) }earbook of Nattor al Socwtly for Study of 
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ndence of the result 
sixth-grade children get very nearly per- 


resm 


both ways; 
in both ways of giving. Then 


res with little spread 1 
has not much to do with it. In the lower grades, it 1s 


following directions, and in the upper grades 


eed of doing what one is asked to do, orally or in 
Kate Kelley, the primary supervisor ot Des Moines, 
constructing a reading test for I-B children, in which 
step in the process of test building is the administra 
h proposed element orally, and the inclusion of only 
practically all children get correct wi 0? 
ll the difficulty is reading difficulty It is impossible 
which elements will fall low when given orally. Who 
‘edict that when orally presented about 90 percent ol 
de children would sav that “The sun sets in the morn 
is true, or that “Boys and girls never sing’ is true Still 
reliable determination, whereas seemingly | ler sen 
re correctly marked by about 90 percent of first-grade 
] 


hen all sentences included are correctly marked by at least 


ent of the children when given orally and from 50 to 100 
he d led 


when given in print, then it is fair to attribute t 
method of 


to reading. This is a second possible 1 
ssey’ has suggested that we use the lack of over 
listinct groups as a measure of validity. For example, 
National-A is given to all children in high school, who 


lowest marks given, 


two or more of the 


; eived two or more of the highest marks given, the over 
ing 1s small. Only one or two cases out of 100 of the 
narked high, fall below the average score of the gt 
ed low. and only one or two out of 100 of the group 
ced iOW, fall above the average score Ol the eTouU nN irked 
Pressey, S. L. ‘‘Suggestions looking toward a fundamental revis f 
statistical procedure, as applied to tests.’’ Psycholooical Rev 
‘72 November, 1920 
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monstrates why 


es judge this readiness 
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elations with other tests is always 
| whenever a criterion built up 
n and repeated individual examinati 
he preferable technique. 


in era of selection of tests 


uring achievement in 
uch tests. 
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third p ble 1 ess f evaluation. 

hers’ judgment may be Hii 3111 
! I eainst which to correlate test mar] 
eXa . partial validation of the Park-Franzen t 
readi first-grade work, we had each of the fifty ] 
garten teachers at Des Moi of | 
dren; and we then 

iude 
btained fittv correl: 

cerned a series of 
ulgments; and, as the best answer tcl \| 
ranked them according to her esti it, 
respect. If the corrcla 
of best judgment and lowest for the cla! 
poorest judgment—in other words if the sami sh 
high correlation with the teachers’ ii H—thi 
would be a tf rable indication for the test. [Har \ 
if there 1s a direct i 
according to her supervisor, rrelat 
her judgments wih 
test 
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caeterminatiol 
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sible, this is t 
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Editorials 


rld of force and energy, there is a law that to every 
n equal and opposite reaction. Students of his 
| | 
ted out that a similar law seems to operate in 
f men and institutions 
ts and animals and in man himself, growth 1s 
m nor continuous Periods of rapid change are 
latively longer periods in which very little growth 
it with n life and s cial read 
In the few short vears of the world war more new 
ead broadcast among all peoples than in any pre 
n rece rded history It 1s inevitable that the re 
1 ] 11 1 
world to these ideas should be correspond 
nd it will be quite ne | if, wnen the s | ordel 
zed on the new level, there should follow a 
peace and prosperity 
ever, NaS ore s meanin 
ned in human history that changes which have cost 
( { and blood have been completely wiped out V 
i I 
which followed so that the old order was not only 
| but reinforced, and the coming of progress corre 
| 
delaved Ni every reve li on against a 1 imnical 
t has resulted in ereater rreedo! for the revolutior 
1 ‘ | 
than one king has been able to stage a “come back 
ny minor details of progress ‘lore than once the 
mental trial of a new procedure has lead to a reaction 
+1 - + 
vestige of the new idea tor at least 


REACTION 
ration 
It is one of the outstanding characteristics of the times in 
live that the rate of progress has been enorm« usly 
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reased, and at the same time the periods of reacti 
correspondingly reduced. Today change, progress is al 


tinuous The most radical inventions, the most far 


changes hold the public mind only for a few short d 
“1 points are quickly realized and they themselves 
corporated into our daily living. Such reaction 
large regret by the olde ene tion for the pass! 
phase of social custom made superfluous by the new 
tior me new cust n which they do tr sh 
inst the telephone thinkabl Is it prob 
Ol utor iles will s e day be ft bidden na t 
the h drawn vehicle restored How long will t 
nd wireless remain mere marvels? Even now tl 


put to commercial use, although such use involves 


cial relations and will result in social readjustn 


found that a few generations ago thev could not | 
brought about except by revolution and bloodshed 


The secret of the assimilative powers ¢ f the preset 


tion is education. The invention of printing mad 
education possible. Education in turn made p 


appeal to individual thought and reason. Today publ 


each new invention, for each social change accompli 
few short days what formerly it required several get 
bring about Where there is an efficient “selling ( 


which reaches all the individuals involved, there is 
bility of reaction 

For education itself this means that progress is 
upon the organization and maintenance of effective 
training for teachers in service and equally effectiv 
letting parents know the reasons for the changes made. | 


in social life compel changes in subject-matter an 


of teaching; and the education which the school 
not permitted to lag far behind the education need 
social life of the dav Society cannot wait for the sl 


cesses Of natural evolution to replace old teachers with 


On the other hand, to make changes which are not 


ier the teaching force or the public is to invite 


lustration will prove the point Twelve vears 
labl nstrument for the measurement 
| ducts was invented len vears ago standard 
d for the first time 11 rreat citv surve\ Now 
les o education what a ruler is to a carpent 
physici n Thei has beet 
evond all question. The contributions to education 
-urement have been many and tf reaching Phe 
toes not measure, who does not base her tl inking 
tained 1 screntihe experimentat 
lation to the one who does that the old country 
ears to the modern diagnostician Foday the 
e being graduated from our teacher-training 1 
( not even conscious that hey ire measul 
s | ttegral part of teaching and 1s 
( sing t 1 pres ly the cce] the 
1 the utol obile 
bout thr teachers 1n service d the general | I 
lings of reaction are heard trom time to time nd 
delaved. Fortunately, verv fortunately, the teacher: 
have realized the situation and have cr wded into 
ls and regular sessions of our universities ind col 
themselves the newer ideas ind 1 etl “1s 
e children carry word from school to home nd serve 
parents. Fortunately, also, the children have 
ests and realized their valu Thus it has come it 
( ve has been successfully effected in educatior 
nger of disastrous reaction voided 
ool horities can alwavs hope to be equally f 


St ¢ e to see that there can be no stable 1 ress, 1 
f blighting reaction without deliberate, well-plant 


mutual understanding between the 


d adequate provision in time and opportunity for t 


ining of the teachers in service. 
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